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The mosquito Aedes aegypti is the principal vector for the yellow fever and dengue viruses, and is also 
responsible for recent outbreaks of the alphavirus chikungunya. Vector control strategies utilizing 
engineered gene drive systems are being developed as a means of replacing wild, pathogen transmitting 
mosquitoes with individuals refractory to disease transmission, or bringing about population suppression. 
Several of these systems, including Medea, UD MEL , and site-specific nucleases, which can be used to drive 
genes into populations or bring about population suppression, utilize transcriptional regulatory elements 
that drive germline-specific expression. Here we report the identification of multiple regulatory elements 
able to drive gene expression specifically in the female germline, or in the male and female germline, in the 
mosquito Aedes aegypti. These elements can also be used as tools with which to probe the roles of specific 
genes in germline function and in the early embryo, through overexpression or RNA interference. 

Aedes aegypti is the major vector for yellow fever and dengue viruses 1,2 . About 2.5 billion people are at 
risk for dengue, with 50-100 million infections and —25,000 deaths each year 3 . No vaccine is available, 
leaving vector control the only option for prevention. The emergence and spread of insecticide resistance 
poses a threat to the sustainability of these efforts 4 . Vaccines are available for yellow fever, but there are still 
—200,000 cases each year, resulting in —30,000 deaths 1 . As a result, there has long been interest in the develop- 
ment of novel genetic strategies for Aedes aegypti population management. 

Two general strategies have been envisioned, in a variety of implementations: population suppression, and 
population replacement with individuals refractory to disease transmission 512 . Several versions of these strategies 
either work well with, or require the use of regulatory sequences that drive expression primarily or exclusively in 
the female germline. For example, synthetic Medea and UD MEL elements, which have been shown to drive 
population replacement in Drosophila 1315 , require the use of a maternal germline-specific promoter. A Medea 
selfish genetic element consists of two linked genes: a toxin that is expressed only in the female germline, with 
effects that are passed to all progeny, and a neutralizing antidote, expressed under the control of an early zygote- 
specific promoter. Medea spreads because when present in females, it causes the death of all progeny that fail to 
inherit it, thereby promoting its spread at the expense of homologous chromosomes that lack it 1619 . In the UD MEL 
system, which has a higher introduction threshold, and is therefore more easily recalled from the population, 
similar components are used, but they are located on different chromosomes 14 . Another, proposed strategy for 
bringing about population replacement, utilizes engineered site-specific DNA endonucleases, often referred to as 
homing endonucleases (HEGs). Here the idea is that transgenics can be created that carry an engineered HEG 
located at the same position in the genome as its target site. If expression of the HEG is activated in the male and/ 
or female germline in a heterozygote, cleavage of the target site on the wildtype chromosome followed by DNA 
repair that utilizes the HEG as a template can result in an increase in the frequency of the HEG in the population. 
If the insertion site is located in a gene required in somatic tissues for female viability or fertility, this spread is 
predicted to result in the population experiencing a fitness cost, which can result in population suppression. If the 
population also carries an unlinked construct containing a fitness cost-rescuing transgene and one or more genes 
conferring disease refractoriness, it is predicted that the latter construct will spread as the HEG makes its way 
through the population 6,20 . Engineered site-specific nucleases expressed using germline promoters are copied into 
artificially inserted target sites in both Drosophila and Anopheles mosquitoes, and rapidly spread into populations 
engineered to carry those target sites, providing support for this approach 21,22 . 

In Aedes aegypti, a number of regulatory elements able to drive gene expression in specific tissues have been 
identified through transgenesis. These include the midgut 23,24 , the eye 25 , salivary glands 26,27 , testis 28 , fat body 29 " 31 , 
female flight muscle 32 , and ubiquitous expression 33,34 . Several regulatory elements have also been identified that 
drive expression in the female germline. However, the expression pattern of these is either weak or not specific to 
the germline cells (Supplementary Figure 1) 33,35 ~ 3 ' J . 
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Figure 1 | RNAseq expression analysis. RPKM expression values for AAEL007097 (A), AAEL007584 (C), AAEL010097 (E), and AAEL000923 (G) were 
plotted across development. Samples include, from left to right: testis; male carcasses (lacking testes); carcasses of females prior to blood feeding (NBF), 
and at 12 hr, 24 hr, 36 hr, 48 hr, and 72 hr; ovaries from NBF females and at 12 hr, 24 hr, 36 hr, 48 hr, and 72 hr; embryos from 0-2 hrs through 
72-76 hr; whole larvae from 1st instar, 2nd instar, 3rd instar and 4th instar; male pupae; female pupae. Genome browser snap shots were extracted from 
the genome browser at www.vector.caltech.edu for germline samples including both ovaries and testes (B,D,F,H,). Red box indicates coding 
sequence; green box indicates the annotated UTRs; yellow box depicts the putative regulatory regions chosen; and purple box indicates the SV40 UTR. 
Y axis in panels (A,C,E,G) indicates normalized expression levels in RPKM; the Y axis in the genome browser snapshots shows expression level based on 
raw read counts. 



As a first step towards identifying new transcriptional regulatory 
elements active in the female germline we recently carried out a 
comprehensive sequencing of the Aedes aegypti developmental tran- 
scriptome, incorporating 26 whole animal samples at different time 
points and 16 adult samples representing gonads or carcasses from 
which gonads had been removed 39 . Of particular relevance for the 
current study, a number of sex-specific samples were collected. These 
included sex separated male and female pupae as well as 16 addi- 
tional samples from adults. This latter set included dissected whole 
ovaries from non-blood fed females (NBF) and from females at 
12 hr, 24 hr, 36 hr, 48 hr, 60 hr, and 72 hr post-blood-meal 
(PBM); carcass samples (whole female bodies lacking ovaries) were 
also collected from these same time points. These samples cover 
ovarian and follicle development from previtellogenic "resting stage" 
NBF ovaries through the completion of oogenesis at 72 hr-PBM. 
Samples were also isolated from adult male testes and accessory 
glands (AG) as a single sample and male carcasses (lacking testes 
and AG) at four days post eclosion. Analysis of this data set identified 
a modest number of genes predominantly expressed in the ovary and 
early embryo, as would be expected for genes that are expressed in the 
maternal germline, with their transcripts being deposited into the 
oocyte. Transcripts whose expression subsequently trended down- 



wards during early embryogenesis were of particular interest as this 
would be consistent with little or no zygotic expression in the 
embryo. 

Here we report transgenesis-based analysis of regulatory regions 
surrounding four of these genes. For three genes, AAEL007097, 
AAEL007584, and AAEL000923, we were able to identify sequences 
that drive strong-maternal-specific expression in the ovary. For a 
fourth gene, AAEL010097, sequences able to drive female- and 
male-germline specific expression were identified. 

Results 

Identification and selection of candidate germline genes in Aedes 

aegypti. We selected four genes, AAEL007097, AAEL0 10097, AAEL- 
000923, and AAEL007584, for analysis because they were expressed 
at high levels, and predominantly in the ovary (Figure 1 A, C, E, G). 
As a point of comparison, the Nanos (AAEL012107) 35 and Vitello- 
genin receptor (AAEL007657) 36 genes, whose regulatory elements 
were characterized previously, are expressed at peak levels (RPKM) 
of 148 and 92 in the ovary, respectively. The genes characterized here 
have peak ovarian expression levels of 14,521, AAEL007097; 2,838, 
AAEL010097; 1,341, AAEL000923; and 3,420, AAEL007584 (see 
supplementary Figure 1 and Figure 1). 
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AAEL007097 is part of a genomic locus containing four closely 
related genes whose proteins are likely 4-nitrophenylphosphatases. 
Of these, AAEL007097 is the most highly expressed, and is expressed 
predominantly in the female germline. Transcript levels peak in 
the pre-vitellogenic (Non-Blood Fed: NBF) ovary (Figure 1A) and 
remain high (avg. 9359 RPKM) through ovarian development. Upon 
egg laying transcript levels decrease drastically and do not begin 
increasing until larval stages, presumably reflecting expression in 
the developing germline. AAEL007097 is also expressed in the male 
germline, but at very low levels (~2 RPKM). The other three nearby 
related genes (AAEL007090, AAEL007094 and AAEL007098) are 
very weakly expressed in the germline (<20 RPKM), based on our 
RNAseq data (Supplementary Figure l) 39 . AAEL007090 is expressed 
mainly in the carcasses of both male and females and AAEL007094 
and AAEL007098 are both expressed in mid-early embryogenesis 
(8-12 hrs)(Supplementary Figure 1). 

AAEL007584 is the 1 : 1 Aedes aegypti orthologue of the 
Drosophila trunk gene, whose product is required for embryonic 
anterior-posterior polarity. In Drosophila, Trunk protein is synthe- 
sized in nurse cells and transferred to the mature oocyte. Trunk is 
thought to be the ligand of the torso receptor 40 , which is responsible 
for the activation in the anterior and posterior ends of the embryo of 
the terminal gap genes tailless and huckelbein. Interestingly, Aedes 
aegypti has two torso-like genes; one that is most abundantly 
expressed in the ovary as with the ancestral Drosophila gene 
(AAEL002404), and a second (AAEL010923), arising from a recent 
duplication, that is exclusively expressed in early embryos. The 
Drosophila torso gene has three isoforms generated by alternative 
splicing, one of which, Tor-RC, is exclusively produced in early 
embryos 41 . Together, these observations suggest that while both spe- 
cies generate maternal and zygotic versions of the Torso protein, 
Drosophila achieves this by alternative splicing, while Aedes utilizes 
gene duplication. AAEL007584 (Aa-trk) is maximally expressed in 
pre-vitellogenic (NBF) ovaries. Upon blood feeding, its expression 
decreases dramatically (10 fold), but remains above background (avg. 
361 RPKM) for the remainder of oogenesis. A slight increase in 
expression is observed during early stages of embryogenesis. No 
expression above background is observed in the male germline. 

AAEL010097 (Aa-exu) is the Aedes 1 : 1 orthologue of the 
Drosophila exuperantia gene, whose product is best known for its 
role in bicoid mRNA localization during oogenesis and in the early 
embryo 42 . AAEL010097 is expressed at high levels in the ovary (up to 
-2800 RPKM), and at much lower levels (<3 RPKM) in testes. In 
females AAELO 10097 expression is low in the NBF ovary. High-level 
expression occurs following a blood meal, and expression remains 
high throughout oogenesis. AAEL010097 is also abundant in freshly 
laid embryos (0-2 hs), presumably reflecting maternal transcript 
deposition into the oocyte, since zygotic expression is not thought 
to occur at this early stage. 

AAEL000923 encodes a protein predicted to have multiple EGF-like 
repeats. Blast searches identify related proteins in other insects, but 
their functions have not been determined. AAEL000923 expression 
is female germline-specific. Expression is relatively high throughout 
oogenesis, peaking at 72 hrs PBM (1,341 RPKM). AAEL000923 is also 
abundant in early embryos (0-2 hrs), after which transcript levels fall 
continuously such that by 12-16 hrs transcripts are rare (<29 RPKM). 
Examination of Modencode and Mozatlas data sets shows that both 
Drosophila (CGI 1674) and Anopheles (AGAP010789) orthologs of this 
gene are expressed in a similar female germline-specific manner (data 
not shown). 

Generation of transformation constructs and transgenic lines. We 

used RNA-seq data to identify likely transcription start sites for the 
above genes (Figure IB, D, F, H). For each gene we isolated a 
fragment of genomic DNA that extended from 2-3 kb 5' to the 
transcription start site, through the 5' UTR, to the translation start 



codon. For gene AAEL010097 we also included a 1075 bp fragment 
downstream of the stop codon, including the entire 3 'UTR and 
flanking genomic DNA. For AAEL007097, AAEL007584, and 
AAEL000923 we utilized the SV40 3'UTR (Figure IB, D, F, H). 
Finally, we also isolated regulatory sequences from the An. gambiae 
vasa gene shown previously to be sufficient to drive expression in the 
male and female germlines 43 . Each of these fragments was cloned into 
pMos 3xP3-eGFP 25 , and three to four independent transgenic lines 
were isolated for each promoter tested (see methods for details). In 
the sections below we describe the expression patterns of represen- 
tative transgenic strains for each gene. 

Transcriptional regulatory element expression pattern, as visua- 
lized with eGFP. To determine transgene expression pattern in the 
female germline we examined eGFP expression in ovaries from hete- 
rozygous females at several different stages of development: the pre- 
vitelogenic non blood fed ovary (Figure 2A); the 24 hr PBM ovary, 
which allows visualization of the germarium, secondary follicles, 
nurse cells of the primary follicle, the oocyte and the surrounding 
somatic follicle cells (Figure 2B); follicles from 24 hr PBM ovaries, 
which allow visualization of the germarium, the degenerating nurse 
cells, somatic follicle cells, and the mature oocyte (Figure 2C). The 
testes of transgenic males were also examined (Figure 2D). 

Each adult Aedes aegypti ovary consists of approximately sixty 
ovarioles (Figure 2A). Each ovariole contains a maturing follicle, a 
secondary presumptive follicle, and a germarium containing both 
somatic and germline stem cells (Figure 2B, schematic). Each follicle 
is comprised of seven germline nurse cells and one oocyte, sur- 
rounded by a somatically derived follicular epithelium. High levels 
of eGFP are present in the pre-vitelogenic follicles from AAEL007097, 
AAELO 10097, AAEL000923 and AAEL007584 transgenics. Expres- 
sion is particularly high in follicles from AAEL010097 (exu) trans- 
genics. eGFP is also readily apparent in the secondary follicles and 
germaria from AAEL007584 (trunk) transgenics (Figure 2A, inset). 
Modest variegation in eGFP expression is apparent in pre-vitelogenic 
follicles from AAEL007097, AAEL010097, and AAEL000923 trans- 
genics. Variegated expression is quite strong in transgenics carrying 
regulatory sequences from Anopheles vasa, with many pre-vitelogenic 
follicles appearing to lack expression entirely. Interestingly, however, 
by 24 hrs, all maturing follicles within an ovary, for all four trans- 
genes, show comparable levels of eGFP expression (Figure 2B). 
Maturing follicles from transgenics containing the AAEL010097 
regulatory region show the highest levels of eGFP; eGFP can also 
be seen in the secondary follicle, but not the germarium. At 24 hrs 
eGFP is also present in secondary follicles of AAEL0070974, 
AAEL007584 and AAEL000923 ovaries, but not in those from 
Anopheles vasa. High levels of eGFP are also observed in the germar- 
ium of trunk transgenics at 24 hrs, but not in the germarium of other 
transgenics. Importantly, eGFP expression was not observed in 
somatic follicle cells at any stage, for any of the transgenics, indicating 
that, at least in the ovary, these regulatory elements drive germline- 
specific expression. 

At 72 hrs the nurse cells have dumped their contents, including 
eGFP, into the mature oocyte, and are in the process of degeneration. 
To get a rough comparative estimate of the total amount of maternal 
eGFP expression we imaged the same oocytes using exposure levels 
ideally suited for each individual oocyte (Figure 2 C, left panel), or at 
a constant exposure (Figure 2 C, right panel). This analysis showed 
that the regulatory regions of Aa-exu (AAEL010097) drive express- 
ion of the highest levels of maternal eGFP, but significant levels of 
eGFP were also observed in all other transgenics. 

Since our developmental transcriptome data suggested that several 
of these genes, AAEL007097, AAEL010097 and AAEL00923, are also 
expressed at low levels in the male germline (between 2-11 RPKM), 
we also dissected and imaged transgenic testes. Only the regulatory 
region from AAELO 10097 drove detectable levels of eGFP in the 
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Figure 2 | Fluorescent analysis of regulatory regions. Previtelogenic NBF ovaries were imaged for each promoteneGFP transgenic line to capture the 
overall expression pattern of follicles within an ovary (A top panel). The inset shows expression of the same ovaries at a higher magnification (A bottom 
panel). Individual ovarian follicles were imaged a 24 hr post blood meal (PBM) (B) and 72 hr PBM (C left panel) for each regulatory element. At 72 hr 
PBM a constant exposure was also taken of each oocyte (C right panel) so as to be able to compare relative levels of eGFP expression for different 
constructs. eGFP expression is also observed in the testes of AAEL010097 (D). Terminology: FE- follicular epithelium, NC- nurse cell, 2°F -secondary 
follicle, G -germarium, HC -hub cells, GSC -germ stem cells, SSC - somatic stem cells, MC -maturing cyst, and VE -vas efferens. 



testis (Figure 2D), with expression apparent from the germ stem cells 
onwards, including in mature sperm located in the spermatophore. 
Interestingly, the Ag-vasa regulatory elements did not drive express- 
ion of detectable eGFP in the Aedes testis, suggesting that the enhan- 
cers required for male germline expression of vasa have evolved 
significantly. 

Quantitative RT-PCR to assess germline specificity of regulatory 
regions. As another approach to determining transgene expression 
pattern and levels of expression we performed quantitative RT-PCR 
of dissected tissues from each of the lines described above using 
reporter specific primers (Figure 3). Expression was analyzed in 
the NBF ovary, NBF female carcass (lacking ovary), 24 hr PBM 
ovary, 24 hr PBM carcass (lacking ovary), testis, and male carcass 
(lacking testis) (Figure 3). Transgenics carrying regulatory regions 



from AAEL000923 showed the highest level of expression in the NBF 
ovary followed by AAEL010097, AAEL007584, AAEL007097, and 
Ano-vasa. All transgenics showed only very low-level expression in 
the NBF female carcass. At 24 hr PBM, AAELO 10097 transgenics 
had the highest level of ovarian expression, followed by AAEL- 
000923, AAEL007097, AAEL007584, and Ano-vasa. Again, only 
low-level background expression was observed in the 24 hr-BF- 
carcass for each construct. Only very low-level expression was 
observed in testes for all lines, consistent with the observations 
made using the eGFP reporter. 

Discussion 

To thoroughly understand mosquito biology, and to develop genetic 
means to bring about vector control, we need to be able to manipulate 
gene expression in specific tissues at specific times in development. 
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Figure 3 | Germline specificity of regulatory regions. Quantitative RT PCR for each PromoteneGFP transgenic line measuring relative expression in the 
NBF ovary (red), NBF female carcass (dark green), 24 hr ovary (pink), 24 hr female carcass (green), testis (blue), and male carcass (light blue). Y axis is 
relative expression level. Error bars show standard error. To determine statistical significance, P-values were calculated between germline tissues and 
respective carcasses (brackets). P < 0.05 is indicated with one asterisk, p < 0.005 is indicated with two asterisks, p < 0.0005 is indicated with three 
asterisks. 



This work is facilitated through the use of bi-partite expression 
systems such as the GAL4-UAS system, recently introduced into 
the mosquito 44-46 . It also requires the use of tissue- and stage- 
specific transcriptional regulatory sequences. Sequences able to drive 
maternal-germline-specific expression are interesting for several rea- 
sons. First, as discussed in the introduction, several different genetic 
strategies for population suppression and replacement utilize mater- 
nal-germline-specific gene expression. Second, because many tran- 
scripts and proteins used during early embryogenesis derive from 
maternally-loaded products, maternal germline-specific promoters 
provide tools by which development of the early embryo can be 
manipulated through overexpression or RNA interference. Specific 
expression in the ovary avoids potential problems with lethality 
associated with expression manipulation using promoters expressed 
more broadly. It is also much easier than injecting products directly 
into early embryos, and likely to give greater overall reproducibility 
as compared with gene expression manipulated through intra- 
abdominal injection of DNA/RNA transfection reagents 47 . 

Here we describe the identification of transcriptional regulatory 
regions able to drive female germline-specific, and female and male 
germline-specific expression in Aedes aegypti. The regulatory 
sequences identified drive overlapping but somewhat distinct pat- 
terns of expression. Sequences from AAEL007584 drive female 
germline-specific expression during all stages of oogenesis: in the 
maturing follicle, and in the secondary follicle and the germarium. 
Therefore this regulatory element will be useful in contexts where the 
goal is to drive expression from stem cell stages onwards. That said, 
expression driven by regulatory sequences from AAEL007584, as 
determined by RT-PCR, is not as strong as that driven by sequences 
from the three other Aedes genes at 24 hr and 72 hr time 
points. Regulatory elements from AAEL010097, AAEL007097 and 
AAEL000923 drive strong eGFP expression in maturing and second- 
ary follicles. Therefore, these sequences will be useful when the goal is 
to influence the developing follicles but not the stem cell niche. 
Sequences from AAEL007097, AAEL000923 and AAEL007584 also 



show little or no eGFP expression in the male germline, which makes 
them useful when the goal is to drive gene expression only in the 
female germline. Sequences from AAEL0 10097 drove the highest 
levels of expression in all maternal germline stages other than the 
germarium, where no expression was seen. AAEL010097 sequences 
also drove low levels of expression in the male germline. Therefore, 
provided that low levels of testes expression can be tolerated, 
AAEL010097 sequences will be useful when high levels of gene 
expression are desired. Sequences from Anopheles vasa drive rela- 
tively low-level female germline-specific expression in maturing and 
secondary follicles. 

A common feature of expression driven by sequences from all 
genes other than AAEL007584 was some degree of eGFP expression 
variegation in follicles of the NBF ovary. This may result from posi- 
tion-effect variegation associated with specific insertion sites in the 
genome. However, we think this unlikely because it was observed in 
multiple independent transformants. It is also possible that varie- 
gated expression in the NBF ovary is reflective of epigenetic instab- 
ility of gene expression, as has been observed during early stages of 
Drosophila ovarian follicle differentiation 48 . In any case, regardless of 
the mechanism, expression within follicles of an ovary becomes con- 
sistent by 24 hrs PBM. 

The expression pattern driven by sequences from AAEL007584 
shows a good correspondence with the endogenous expression pat- 
tern, with high-level endogenous gene and eGFP expression being 
observed in the NBF. Good correspondence between endogenous 
transcript levels and eGFP expression in the NBF ovary is also seen 
for AAEL007097. However, in the case of AAEL010097 and 
AAEL000923 low levels of endogenous transcripts are present in 
the NBF ovary, while robust levels of eGFP and transgene mRNA 
is observed in the NBF ovary. This discordance is particularly appar- 
ent in the case of AAEL010097: the endogenous gene is expressed at 
low levels in the NBF ovary, but the transgene is expressed at quite 
high levels. In addition, while AAEL007097 is one of the most abun- 
dantly expressed genes throughout oogenesis 39 , with transcript levels 
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between 3-10-fold higher than those of the other genes tested, eGFP 
levels were comparatively low, while transgene mRNA levels were 
relatively high, though not higher than those from transgenes 
driven by sequences from AAEL010097. These results suggest that 
regulatory elements from AAEL010097 and AAEL000923 needed to 
repress transcription in the NBF ovary are missing from our con- 
structs. In contrast, our constructs may be missing sequences from 
the AAEL007097 locus needed to drive early and high levels of ova- 
rian expression. The remarkably strong eGFP expression seen in 
AAELO 10097 transgenics could also reflect the activity of sequences 
in the 5' UTR that enhance translation as compared with 5'UTR 
sequences present in other constructs. 

This analysis constitutes just a beginning of the regulatory 
sequence analysis that can be carried out using the Aedes devel- 
opmental transcriptome. Stringent screening of this data set, and 
others being generated in Aedes aegypti and other mosquitoes, is 
likely to identify a number of genes whose regulatory elements will 
be useful in furthering our understanding of mosquito biology and 
control. 

Methods 

Mosquito rearing. Mosquitoes used in the experiments were of the Aedes aegypti 
Liverpool strain, the source strain for the reference genome sequence 49 . Mosquitoes 
were raised in incubators at 28°C with 70-80% humidity and a 12 hour light/dark 
cycle. Larvae were fed ground fish food {TetraMin Tropical Flakes, Tetra Werke, 
Melle, Germany) and adults were fed with 0.3 M aqueous sucrose. Adult females were 
blood fed three to five days after eclosion using anesthetized mice. All animals were 
handled in accordance with the guide for the care and use of laboratory animals as 
recommended by the National Institutes of Health and supervised by the local 
Institutional Animal Care and Use Committee. 

RNA extraction and reverse transcription of cDNA. Adult males, adult non-blood 
fed females, and adult females 48 hours post blood feeding, were dissected separately 
and respective testis or ovaries were removed from the carcasses. Total RNA was 
extracted from each carcass or germline tissue sample using the Ambion mirVana 
RNA isolation kit (Ambion/ Applied Biosystems, Austin, TX). Following extraction, 
RNA was treated with Turbo DNA-/ree (Ambion/Life Technologies, Grand Island, 
NY). Reverse transcription was preformed using AMV Reverse Transcriptase {New 
England BioLabs, Ipswich, MA) and oligo (dT) to capture polyadenylated RNA 
strands. cDNA was subsequently treated with RNase H (New England BioLabs, 
Ipswich, MA). All steps were carried out according to reaction conditions specified by 
the manufacturers. 

qRT-PCR. Expression levels were measured by qRT-PCR using SYBR Green I Master 
kit on a LightCycler 480 instrument (Roche Applied Sciences, Indianapolis, IN) with 
the following reaction conditions: initial denaturation at 95°C for 15 minutes, 
followed by 40 cycles of 95°C for 15 seconds, 58°C for 20 seconds, and 72 L 'C for 20 
seconds. The control gene, AAEL000951, was chosen due to its moderate and 
consistent expression of between 500-1000 RPKM across development 39 . Primers 
used for control gene AAEL000951 gene amplification were 951-F and 951-R. For 
gene specific eGFP amplification, the common eGFP primer used was eGFP-R, and 
gene specific primers were as follows: AAEL000923 923-F, AAEL004172 4172-F, 
AAEL010097 10097-F, AAEL007584 7584-F, AAEL007097 7097-F, Anopheles Vasa 
Avasa-F. We used the same lines we used for imaging for qRT-PCR analyses. All qRT- 
PCR reactions were performed in biological triplicate, using tissues from 10-15 
individuals per sample. Relative expression levels were quantified using the 2 _AACr 
method 50 . Independent one-tailed t-tests were performed to generate p values and test 
statistical significance between germline tissues and respective carcasses. Primer 
sequence information can be found in supplemental table 1. 

Plasmid construction. To construct the promoter constructs pMOS-3xp3-eGFP was 
digested and linearized with Fsel/Ascl and this was used as a backbone for construct 
assembly using the Gibson method 51 . For each promoter, we isolated a large (between 
a —2.1-3.1 kb) 5' promoter fragment, based on three parameters: 1) an optimal 
location for primer design, 2) include as much regulatory information as possible 
without venturing into other genes or transcribed features as identified through our 
expression browser (www.vector.caltech.edu), and 3) limit the size of our transgene to 
ensure integration events would occur at high frequency. Using these parameters, a 
2934 bp putative promoter region for gene AAEL000923 was PCR amplified from 
Aedes aegypti genomic DNA using primers 601A and 601B. A 950 bp eGFP-SV40 
reporter and terminator fragment was PCR amplified from pMOS-3xp3-eGFP using 
primers 601C and 601D. A 2133 bp putative promoter region for gene AAEL010097 
was PCR amplified from Aedes aegypti genomic DNA with primers 593A.2 and 593B. 
The 720 bp eGFP reporter was PCR amplified from pMOS-3xp3-eGFP using primers 
593C and 593D. The 1075 bp 3'-UTR for AAEL010097 was PCR amplified with 
primers 593E and 593F. A 3041 bp putative promoter region for gene AAEL007584 
was PCR amplified from Aedes aegypti genomic DNA with primers 592 A and 592B. A 



950 bp eGFP-SV40 reporter and terminator fragment was PCR amplified from 
pMOS-3xp3-eGFP using primers 592C and 602F. A 2500 bp putative promoter 
region for gene AAEL007097 was PCR amplified from Aedes aegypti genomic DNA 
with primers 591A and 591B. A 950 bp eGFP-SV40 fragment was PCR amplified 
from pMOS-3xp3-eGFP using the primers 591C and 602F shown above. These PCR 
products were assembled together, producing the final plasmids pMOS-3xp3-eGFP- 
AAEL000923, pMOS-3xp3-eGFP-AAEL010097, pMOS-3xp3-eGFP-AAEL007584 
and pMOS-3xp3-eGFP-AAEL007097. Primer sequence information can be found in 
supplemental table 1 . 

Generation of transgenic lines. Transgenic Ae. aegypti mosquitoes were created in 
the University of Maryland, College Park, Institute for Bioscience and Biotechnology 
Research's Insect Transformation Facility (http://www.ibbr.umd.edu/facilities/itf) by 
injecting preblastoderm embryos (Higgs white eye strain) with Mos-1 vector- 
containing plasmids (pMos) and plasmids expressing Mos-1 transposase 
(pKhspMos82) 52 in addition to a plasmid mixture for monitoring the quality of 
injections consisting of a minimal piggyBac vector (pBac QC) and a source of piggyBac 
transposase (phsp-Pbac) 53 . The injection mix of vectors and transposase expressing 
plasmids comprised 150 ng/microliter pMos, 200 ng/microliter pKhspMos82, 
50 ng/microliter pBac QC, and 200 ng/microliter phsp-Pbac. The injected embryos 
were hatched in deoxygenated H20 and the surviving adults were mated individually 
in 12oz containers at a ratio of 1 : 3 (injected: WT) to facilitate the identification of 
independent integration events, and each transgenic line established was maintained 
as an independent stock. For each construct three to four independent transgenic 
lines were obtained. In total we isolated 3 independent lines for pMOS-3xp3-eGFP- 
AAEL000923, 4 independent lines for pMOS-3xp3-eGFP-AAEL010097, 3 
independent lines for pMOS-3xp3-eGFP-AAEL007584, 4 independent lines for 
pMOS-3xp3-eGFP-AAEL007097, and 4 independent lines for Anophles -Vasa -eGFP. 
Most of the independent lines (15/18) obtained showed robust germline expression, 
and as expected this expression varied slightly in expression levels due to PEV. In 
more detail, 2/3 lines for pMOS-3xp3-eGFP-AAEL000923, 4/4 independent lines for 
pMOS-3xp3-eGFP-AAEL010097, 3/3 independent lines for pMOS-3xp3-eGFP- 
AAEL007584, 3/4 independent lines for pMOS-3xp3-eGFP-AAEL007097, and 3/4 
independent lines for Anophles-Vasa-eGFP each showed robust GFP expression in 
the germline. To ensure that these lines represented single chromosomal insertions, 
we backcrossed single individuals from each of the lines for four generations to our 
wildtype stock, and measured the Mendelian transmission ratios in each generation. 
In all cases, we observed a 50% transmission ratio in each generation, indicating that 
our strains represent insertions into single chromosomes. 

Microscopy. We imaged one representative line (a line whose expression of eGFP 
incorporated patterns observed in all transgenic lines for that construct) for each 
construct using heterozygous individuals. For each line we imaged 5-10 individuals 
and used the most representative images in figure 2. Dissected gonads were fixed in 
methanol-free 4% formaldehyde (Pierce) in 1 X phosphate buffer saline (PBS) for 
30 min and washed 3 times for 15 min in 0.1% Tween-20 PBS. The actin 
cytoskeletons was stained with phalloidin for 10 min and washed with PBS. Gonads 
were mounted on fresh slides containing Vectashield mounting medium with DAPI 
(Vectorlabs. Inc.) and cover slips. Multiplane z-stacks were collected by confocal 
analysis for testis, NBF and 24 hrs PBF ovaries. Single plane images were collected for 
samples of 72 hr PBF oocytes using wider aperture size and4X magnification. L1-L2 
larvae from transgenic strains were fixed for 30 min in methanol-free 4% 
formaldehyde (Pierce) and washed with 1 X PBS. Imaging was performed on a Leica 
M205C epifluorescence- dissecting microscope. 
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